Text-To-Visual Speech in Chinese Based on Data-Driven Approach
نویسنده
چکیده
Text-To-Visual speech (TTVS) synthesis by computer can increase the speech intelligibility and make the human-computer interaction interfaces more friendly. This paper describes a Chinese text-to-visual speech synthesis system based on data-driven (sample based) approach, which is realized by short video segments concatenation. An effective method to construct two visual confusion trees for Chinese initials and finals is developed. A co-articulation model based on visual distance and hardness factor is proposed, which can be used in the recording corpus sentence selection in analysis phase and the unit selection in synthesis phase. The obvious difference between boundary images of the concatenation video segments is smoothed by image morphing technique. By combining with the acoustic Text-To-Speech (TTS) synthesis, a Chinese text-to-visual speech synthesis system is realized.
منابع مشابه
Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملComparing text-driven and speech-driven visual speech synthesisers
We present a comparison of a text-driven and a speech driven visual speech synthesiser. Both are trained using the same data and both use the same Active Appearance Model (AAM) to encode and re-synthesise visual speech. Objective quality, measured using correlation, suggests the performance of both approaches is close, but subjective opinion ranks the text-driven approach significantly higher.
متن کاملData Driven Approaches to Phonetic Transcription with Integration of Automatic Speech Recognition and Grapheme-to-Phoneme for Spoken Buddhist Sutra
We propose a new approach for performing phonetic transcription of text that utilizes automatic speech recognition (ASR) to help traditional grapheme-to-phoneme (G2P) techniques. This approach was applied to transcribe Chinese text into Taiwanese phonetic symbols. By augmenting the text with speech and using automatic speech recognition with a sausage searching net constructed from multiple pro...
متن کاملجُستاری در رویکرد دیالکتیکی به «خواندن»
Purpose: This article tries to explain that reading is a dialectical action. For this purpose, it refers to the concept of dialectics in ancient times and, with a glance at the concepts of man, world, science, language and knowledge, it tries to discuss the dialectical status of reading. Method: In the present article, a conceptual analysis approach has been used. This approach that is used i...
متن کاملComparative Approach to the Relationship Between Text and Hand Visual Language in Tahmasebi’s Shahnameh Pictures
The painters of Tahmasbi Shahnameh, in order to depict the text full of the story of Shahnameh, tried to convey emotions and excitement to the audience by using the visual language of the hand. Due to the multiplicity of applications of this type of nonverbal communication in different situations, the painter may have undergone changes in parts of her painting under the influence of various fac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005